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0 Human proinsuQn end analoga tharaof and method of preparation by mkroblal polypeptide espieaeion and conversion 
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0 Microbial «x or vision of a chimeric gtna is used to 
produce a polypeptide comprising the amino add s*quenee 
of human promsulin. or an analog thereof differing in the "C* 
chain portion. A polypeptide so produced contains a se¬ 
quence of additional amino acid unite sufficient in number to 
protect it from bacterial protease*, and haa a cleavage site 
e.g. a methionine residua adjacent the sequence of amino 
acid unite corresponding to the proinsulin or proinsulin 
enetoQ. Oeavege at this site (e.g. by CNBr) generates 
protnsulm (or the analog) which is treated in vitro to form the 
disulfide bonds between the ~A~ and "B~ chain proteins 
characteristic of human insulin. The **C~ chain portion it then 
excised enzymatically to yield human insulin useful e.g. in 
the treatment of diabetes. 

The chimeric gene may be synthesised from oligonu¬ 
cleotides and inserted into a plasmid which la used to 
transform a host call. e.g. C, coti. 
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HUNAN PR0IHSUL1N AND ANALOGS THEREOF 
ANO METHOD OF PREPARATION BY MICROBIAL 
POLTPEPTIOC EXPRESSION ANO 
CONVERSION THEREOF TO HUMAN INSULIN 

t 

r«i md Applications 

Thl» applIcation I* related to and Incorporates by 
rtferenco the disclosures of European Pftent Application 
Publications Not. 0001930 (A.N. 7B300S97.8) and 0014776 
(A.N. 81301227.5). 

Field of the Invention 

This Invention relates to nicroblol espression of 
polypeptide*. In one «»pect. It relete* to the preparation of 
gene* for the oicroblally enpresslble production of 
Interoediatet useful In the preparation of huoan in*ul1n. In 
another aspect. It relate* to the preparation of huoan 
proinsulin or analog* thereof differing froo huoan prolntulln 
In the.'C* chain portion. In yet another aspect. It relate* to 
the preparation of huoan Insulin froo the prepared huoan 
proinsulin or an analog thereof. 
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Oiabetes, tilt Hum* condition ehereeterUed by t ftiltrt of 
tbt pancreas tb generate tftt polypeptide hornone in 

sufficient quantitits, In sever. e«tt> •» »to»t. 1* eorr.ntly 
treated by injection of insulin derived fro* th. ponertos of 
slaughtered ani.als. •«*««• Porcino Insulin, in 

particular, aro uaod for tbi* purpoao. 

Th . ytt of insulin derived fro. ani.als is unsatisfactory 

fro. at least two standpoints. In tbt first placo, tbo 
extraction of insulin fro. the pancroas of slaughtered ani.als 
is a eo.plox process that requires larpe quantities of the 
orpans. Secondly, and .ore i.portantly fro. the diabetic^ 
point of view, the insulin derived fro. ani.al sources is not 
eite.ically identical to hu.an insulin, differing in the 

of peptide units. Furtheroore. it so-etl... contain. 
„on.ho.olo,o». *»i.*I hor.ooes. such as th. corresponding 
proinsuiin, albeit in s.all quantities. As a result, the 
response of patients treated with ani.al derived insulin is not 
ii tati.factory as desired. For ....pie, an i«««. response to 
ani.al insulin is believed to be a source of chronic 
co.plications in certain treatments of diabetes. 

Accordingly. there has gone unfilled a long felt need to 
have a source of insulin identical che.ically to hu.an insulin, 
unconta.inated by other biologically active i.purities. in 
••ounts sufficient to per.it diabetics to be treated 
econo.ic.lly. Co.pHc.tlng this task is the co.ple. che.ic.l 
structure of hu.an insulin. Structurally it has two 
polypeptide chains referred to as th. ’A* and *B* chain, bound 
t, each other by disulfide bonds. The A chain, so.. 21 
acid units in length, is bound (crosslinked) to the B chain, a 
chain of JO a.i«o acids, through disulfide bonds between units 
of the a.ino acid cysteine in each chain. 
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To bo aaxlaally effective U huaans, the iiIao odd udU 
of iniuHn aust be precisely ordtrod to correspond to thot 
produced 1o vivo. However, tho coaplexlty of tho aolecute It 
»«ch thot conventional aothodt of chealcal synthesis oro 
unsulted to Its ^preparation, on o eoaaerclal scale ot 1ea*t. 

Insulin ft producod jjn tho In tho piAcroot 1 a tho fora of 
preproinsulin. Preprolnsulla It 0 polypeptide coaprlsiag tho 
21 units of tho A chain, tho 30 units of tho I chain, a 
bridging or connecting chain of 35 units roforrod to as tho C 
chain and a 2* aalno add •presequence" (Not Ala Lou Trp Hot 
Arg leu Lou Pro leu Lou Ala leu Leu Ala Lou Trp Gly Pro Asp Pro 
Ala Ala Ala) attached to tho H*tera1na1 phenylalanine aalno 
add beginning the B chain. Prolnsultn, lacking the 
presequence Is shown In Figure 1 with a aethionine aalno add 
in place of the pretequence. This presequence nay participate 
in secretion froa the cells In which It It produced. At tho 
preprointulIn Is excreted froa the Islet cells on the pancreas, 
the pretequence Is excited to leave the prolnsultn chain. Thlt 
chain folds to a structure In which three disulfide bonds are 
foraed, two of which are between the A and B chain segaents of 
the prolnsulln. The connecting C chain Is then excised 
proteolytlcally to leave a residue which Is Insulin, consisting 
of the A and B chains bound together by tho disulfide bonds. 

This application describes a aethod for obtaining huaan 
Insulin and huaan prolnsultn and analogs thereof which differ 
froa huaan prolnsultn In the sequence of aalno adds asking up 
the C chain. The aethod utilizes the burgeoning recoablnant 
ONA technology. The following discussion of eleaents of the 
technology provide background to the detailed description of 
the Invention. 

With the advent of recoablnant OKA technology, the 
controlled bacterial production of useful polypeptides has 
becooe possible. Already in hand are bacteria aodified by this 
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technotogy to permit to* production of such polypeptide 
products ot somatostatin (*• Itokurp *t aU. $«!•««% 124* l0$ * 
(l*»n. *M (component) A ond • Chains of human Insulin (0.*. 
Cooddol 11 £».. Proc. hat'I . Acad. >cl, USA 76. IN ( l97, *> •"* 
huoon growth hormone (O.U. Cooddol ot aJ.. Ooturo 241 • * 44 
( 1979 )). Such It tho power of the technology thet virtually 
,ny g „f„i polypeptide ney he bacterlally produced, putting 
• Ithln retch the controlled ntnufteture of hormones, emynet. 
antibodies, tnd .acclnes useful tgtinst t wide .tnety of 
dlsetses. The cited ntterltls. which describe in gretter 
dettll the representative eaanples refer.ed to above. tre 
Incorportted herein by reference, ts tre other publlcttlont 
referred to Infra , to lllunlntte the background of the 
invention. 

The -ork horse of reconblntnt OnA technology Is the 
plesnld. tn eatra-chronosonal loop of double-stranded 0»A found 
in btcterlt. oftentlnes In nultlple copies per btcterltl cell. 
Included m the Infornttlon encoded In the plasmid 0«A Is that 
required to reproduce the plesnld In deughter cells (l.e.. t 
•repllcon’) tnd ordlntrlly. one or nore selection 
ehtrtcterlstlcs. such ts retlsttnce to mtlblotlcs. -itch 
perolt clones of the host cell eonttlnlng the pltsnid of 
interest to be recognised end preferentItlly grown *" »«’«et1*e 
■edit. The utility of pltsnlds. which ctn be reCbwered tnd 
isolated fron the host nlcroorgtnlsn. lies in the feet thtt 
they ctn be specifictlly cleared by one or another restriction 
endonuclease or •restriction ensyne*. eteh of which recognites 
, different site on the pltsnidlc DHA. Thereafter heterologous 
genes or gene fragments nay be Inserted Into the *lasnid by 
endwise joining at the cleavage site or at reconstructed ends 
adjacent the cleave site. 
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At uted herein, the tore ‘heterologous* refers to • font 
oot ordinarily found In, or o polypeptide sequence ordinarily 
•ot produced by. tbo host atcroorgonlsa whereas tk* tora 
*boaologout* rofori to • gene or polypeptide which It productd 
In tho bott aleroorioolta, tocb ae»C. coH. ONA rtcoablootlon 
It perforaed outtldo tho oicroergenlsat bot tbo rotoHlap 
•recoaolnent* plttald con bo Introduced into alcroorgonitat by 
o procott known ot trontforaotfon ond forgo quantities of tho 
htttrologowt gene-containing rocoabinont plotaid obtoinod by 
growing tho trontforaont. Moreover, whoro tbo gont it properly 
intortod with roforonco to portiont of tbo plotaid which govern 
the tronteription ond trontlotion of tbo encoded DHA actsogc. 
tho rctulting plotaid or ‘enpression -ibide*, when 
incorporotcd into tho bott aicroorgonita. diroett tbo 
production of tho polypeptide tequence for which the interted 


Exprettion it initlotod in o region known ot tho promoter 
which it recogniaed by ond bound by RHA polyaerote. In toao 
cotot. ot in tho trp operon dftcutted infro . proaotcr 
rogiont ore overlopped by ‘operator* regiont to fora o coabined 
proaotor-operotor. Operotort ore ONA tequencet which ore 
recognitod by to-colled roprettor protcint which terve to 
regulotc the frequency of tronteription initiation ot o 
particular proaotcr. Tho polyaerote trovolt olong the DMA. 
trantcribing the inforaotion contained in the coding itrond 
froa it* S' to J* end into aettenger ANA which it in turn 
trontlotod into o polypeptide having the aaino acid tequence 
for which the ONA codet. Coen oaino acid it encoded by o 
unique nucleotide triplet or *codon* within what aoy for 
pretent purpotet be referred to ot the ’structural gene*, i.e. 
that port which encodet the oaino acid tequence of the 
etpretted produet. After binding to the proaoter. the ANA 
polyaerote Mrtt trantcribes nueleotidet encoding a ribotooc 
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blndlng site. the* a translation Initiation or # itirt § signal 
(ordinarily ATO, which In the resulting messenger RRA becomes 
AUC), then the nucleotide codons within tht structural Ron* 
Itself. So-called stop codons ere transcribed it tho tnd of 
tht structure! goat whereafter tht polymerase may form «n 
additional sequence of messenger MA which, btesutt of tht 
prtstnet of tht stop signal, win remain uatranslattd by tht 
ribosomes• Ribosomes bind to tht binding sit# provided on tht 
messenger RKA, In bacttrla ordinarily as tht mRRA Is btlng 
foratd, and themselves product tht tncodtd polypeptide, 
beginning at tht translation start signal and ending at the 
previously acntlontd stop signal. The dtslrtd product is 
produced if the sequences encoding tht rlbosoat binding site 
are positioned properly with respect to tht AUC Initiator codon 
and if all rtaalnlng codons follow tht Initiator codon in 
phase. The resulting product aay be obtained by lysing the 
host cell and recovering the product by appropriate 
purification fron other alcroorganisa protein. 

Polypeptides expressed through the use of rtcoabinant DMA 
technology aay be entirely heterologous, as in tht cast of the 
direct expression of huaan growth horaone, or alternatively aay 
coaprlse a heterologous polypeptide and, fused thereto, at 
least a portion of the aaino acid sequence of a homologous 
peptide, as in the cast of the production of intermediates for 
somatostatin and the components of huaan Insulin. In the 
latter cases, for example, the fused hoaologous polypeptide 
comprised a portion of the amino acid sequence for beta 
9 alactos 1 dase. In those cases, the Intended bioactive product 
is bioinactivated by the fused, homologous polypeptide until 
the latter is cleaved away In an extracellular environment* 
fusion proteins like those Just mentioned can be designed so as 
to permit highly specific cleavage of the precursor protein 
from the intended product, as by the action of cyanogen bromide 
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o» HtMonlM. or alternatively by 

«$.. 6.1. Fatent Fubllcatio" !*o. J »♦ 

Hu.an bas hitherto boon obtolotd employing 

techniques of reco.btn.ot OHA technology. The process otod o 
synthetic 9««o for to. A chain -hich Is o.pr.ssod i« £- £*li 
..d . separate synthetic gen. for the 9 chain -Men I. 

expressed t« enoth.r E. con. ^ ^ 

x,ad. id.. «SA^ 76, 106 (1979). The two chains ort obt.ln.d 

„ ehlneric polypeptides (prot.lns) co.prising tho doslrod 
of a.ino .elds (olth.r the A or 9 ch.la ,.,«.nc.) 
peuad to ."other s.ctloa of carrier polypeptide desl9"«b « 
protect tne desired sequence fro. prote.ses In the E. eoH. 

The chi.ertc proteins V,v„ s selective cleavage *«te .dj.c.nt 
the desired polypeptide sequence of the A or 9 ch.ln which 
per.its separation of the desired sequence fro. the carrier 
polypeptide. Isolation of the t-o sequences Is followed by the 

formation in vitro of the disulfide bonds. 

This process is necessarily co.plie.ted by the fact that 
t.o distinct genetically .edified bacterial strains .ust be 
obtained and .aintained. Further, the prior process requires 
separate isolation of the A and 9 chain and the cro,slinking of 

the t-o chains by ...«» of ' — ‘ion •' aU#,Mde ^ 
-ithout the aid in orientation provided by an intact C chain. 

The present application describes a process for the 
construction of a single gene to express. In a single 
■icroorganisa. a chi.eric protein -hich Includes a co.plet. 
hu.an proinsulin polypeptide or an analog thereof differing 
fro. hu.an insulin in only the a.ino acid sequence of the C 
chain. Hu.an insulin can be cleanly eacised fro. these 
polypeptides after in vitro for.ation of the disulfide 
crosslinks between the A and 9 chains. 









0055945 


- 8 - 


1 , tbit pnctit. prtiitiUi, u< can »« 

dlrcctl? obtained !• pare for* bad free » f 

biologically active I-P«rlt1.». Sl.llarly. tb. prol.i.Ho. coa 
b. off.ctlr.1y processed to obtain I...I1. che.ic.lty ld.otlc.1 
to b-.an latolia ol.llorly fr«. of biologically 

i.porltlo* tbu» profiting • 
tbaa possible «i«9 


t„—.ft of the Invention 

Ta. present invention provides a -ethod for obtaining bo... 
b, naans of a chin.rlc ***«**• e^ioing the poIyw»tide 
M q uence oi lu»» proinsulin, or an analog thereof differing U<m the 
polypeptide sequence or hu-. proin.ulin in the 
acid, coopt ining the C. chain. fused to additional protein or 
protein frag-ent. there being a selective cl.av.,. .it. -bich 
peroits cleavage of the proin.ulin or it. analog fro. the addition, 
protein or frag-ent. The cleaved ptoinwlin product «ny then be 
caused to orient by foroation of the characteri.tic i».uli« A 
,„d B Chain disulfide crosslink, and the cro..link«d inwlin 
' precursor -ay then be excised fro- the C’ chnin carrier. 

The -C- chain {or, hereinafter, bridging chain) of aofno 
,cid -nits of the analog pr.in.olins o.d. according to the 
present invention -ay co.prise as few as l a.i»o acid -nits. 

The Identity and s.g-enc. of ..in. acid -nits i.t.ro.di.t, the . 
ends of the bridging chains in analogs of h—n proi.s-lin are 
not particularly significant. Hovever. the end cits thereof 
.ost be -nits -bich per.lt facile eacision of the bridging 
chain fro. the A and 8 chains of huwn ins-1 in. Preferably, 
eacision occ.rs after the proin.-lin .ol.cule ha. been cleaved 
fro. the addition protein of the chio.ric polypeptide and. -ost 
preferably, after the thus cleaved prolnsulin .bled, has been 
folded and the disulfide lints bet-een the A and « chains 
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characteristic of huaen Insulin have boon formed. 

Preferably. ths bridging chain has sitos which permit Its 
excision by enxymatic means. Preferred for this purpose 
oco Arg-Ar 9 and lys-Arg units on tho bridging chain which art 
adjacent the terminal -COQH end of the I chain and teralnal 
•NH 2 end of the A chain, respectively, as is found In huaan 
proinsullo Itself. Also preferred are two Arg-Arg units 
between the B and A chains. 

The chiaerlc proteins of the present Invention are obtained 
by expression of a heterologous structural gene for the 
proinsulin or analog' In a recoablnant aicroblal cloning vehicle 
in which the geno U In reading phase with a ONA sequence 
coding for an addition protein portion of the chiaerlc protein 
and the cleavage site. i = preferred eabod'aents of the 
invention, the cleavage site is aethionine at the N-terainal of 
the proinsulin which peralts cleavage using cyanogen broaide. 

The additional protein can vary but the preferred 
additional protein is aethionine aaino acid or the presequence 
of preproinsulin or a portion thereof or a- 9 alactosidase 

or a substantial portion thereof or a portion of the aaino acid 
sequence encoded by a fragment of the trp leader polypeptide 
gene fused to a portion of the trp C polypeptide gene or the 
trp 0 polypeptide. The added, ultiaately superfluous portion 
of the chiaerlc protein is selected to provide protection froa 
bacterial proteases which night otherwise digest the proinsulin. 

The preferred recoablnant aicroblal cloning vehicle is a 
modified, preferably bacterial, plasaid containing the 
structural gene in a reading phase with the ONA sequence which 
codes for the added, superfluous portion of the chiaerlc 
polypeptide and the selective cleavage site. 

The aanner in *h1ch these and other objects and advantages 
of the invention are achieved will be apparent to those stilled 
in the art after consideration of the following description of 
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U 1 «str«t 1 ons of fig*. l-&« 


Brl«f Pttcflotlo o of tht Orgwlgfll 

M 9 vr« 1 show. the ooeleotldo soqoonce of • gone f#' 

profasullo* 

Ffgttrc 2 Illustrates a sche.e for obtaining a pl*s«« 
containing a fr.gft of the 9«»* •' 1 - Meh 4,r1 "‘ 

nsing reeerse transcription fro. ■««*. 

Figure J Illustrate, a sche.e for obtaining a plas.ld 
containing the gene of Figure 1 suitable for tr.nsfr.atlon 
Into, e.g.. C. coll for expression of hunan proinsulin. 

Figure a”r, lustrates the analysis by "PIC chro.atogr.phy of 
huoan proinsulin obtained by expression fro. an, e.g., £. eoH 

tr.nsfor.ant containing the pla.-ld derived fro. the sc. . 

Figure 3. 

Figure 5 Illustrates seg.ents of a gene for expression of 
on analog of hu.an prolnsulln differing fro. hu.an prolnsulln 
in the a.lno acid sequence of the C. bridging chain. 

Figure 6 illustrates a sehe.e for asse.bllng a pl.s.ld 
containing a gene for tr.nsfor.atlon into. e.g.. £. cod for 
expression of an analog of hu.an prolnsulln. 


Detailed Description 
A. Preparation of Hu.an Prolnsulln 

1. Preparation of Synthetic Gene Coding For The 32 
n.Ter.Inal A.!no Adds of Prolnsulln 
a. Oligonucleotide Synthesis 

A series of 18 oligonucleotides, short nucleotide 
chains 10-12 units In length shown in Table 1. -ere prepared a. 
a first step to the construction of a gene coding for the first 
32 a.lno adds of prolnsulln. the a.lno add sequence of which 
„ shown in Flg. 1 a. a part of the nucleotide sequence of the 
entire gene ultimately constructed for u»e in the present- 
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Uveatfoa far tit «a*ra»*loa of liua 'rofatalfa. Tba 
UdUUeal aaeWotfdas fa tte «aaa art Idaatfffad by tba 
lattcrs >. T, C or 6 r«yr«statla# tba. bases aderfee. tbyafae. 


uoUtr. 

TABIC 1 


goovdtotf 4cs: 

HI 

H2 

H3 

HI 

H$ 

Ht 
H7 
MS 
11 
12 
S3 
84 
B$ * 
Bt 
•7 
BB 
•9 
BIO 


aattcatctt 

CCTCAATCAGCA 

CCTTTCTCGTTC 

TCACCTCCTTGA 

TTGACCAACATC 

CAAAGCTGCTCA 

AOCTGAGAACCA 

agcttcaacg 

acctttgtac 

cttgtttgcggt 

CAACG1GGTTTC 

TTCTACACTCCT 

AAGACTCGCC 

aacaaggtacaa 

ACGTTCACCCCA 

GTAGAAGAAACC 

ACTCTTAGCACT 

GATCCCGCG 
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Tki tyitkttlc Hcltttlket are »ko« k«t»«* 

brackets in T ig. 1. These oligonucleotides were 
ijttktiUck by tbs triester setbod: I. Crr« et il.., >r»c. Bat. 

A«■«*. Scl.. OS*. 7S. STBS (1978), *. Itakora et eK. J. 9iol. 
On. , ISO. dS97 (197$) u( I. Itaksra et oK. J. Ao. C*ies. 

Sec.. 97. 7327 (197$). of these are el igooucleotides 

which were seed I* a gen* coding for the 9 chain of honoo 
iotolia previously described by Croa et aK. Proc. Sat. Acad. 
Sci.. 0$*. 75. S7<5 (197*).aad Coeddel ft {].. Proc. Oat . *cad. 
Sci.. PS*. 7*. 106 (1979). The nucleotide sequence* of two 
synthetic nucleotides (OS* and 810*). »*re synthesized for this 
project; tn« other; were prepared according to Crea K eK 
(supra.) The two new ollgooucle;ride*. a)*o prepared according 
to Crea et aK. incorporate restriction enzyse recognition 
site* for Mpa H and tersina) BaoHt, the latter used for 
cloning. The other end of the gene contains a sticky end of an 
fcoBI tit* for cloning purposes. 

0 . Joining of Synthetic Oligonucleotide* 

The eight oligonucleotide* H1*H8 were used 
previously to construct the left half of the 6 chain gene. This 
was used in tms process ano is oescribeo by Coedoei et *1.. 
Proc. Mat! . Acad. Sci. USA 76. IOC (1979). It contains the 
codons for the 1*13 anino acid* of the 8 chain gene and a 
oethtoaine unit at the ■-teruinal, used later t*> cleave the 
proinsu)in froo bacterially espressed chiseric protein using 
cyanogen orouide (CMBr). 

The right half of the 8 chain gen* was obtained 
froa the oligonucleotides Bj. Bj, Bj. B 4 . Bj*. Bj, 

B 7 . Bj. B, and Bj 0 *. by ligation using T 4 ligase ano 
a technique described by Coeddel et *K (supra). The gen* 
fragsent produced codes for the l«-30 anino acid units of the I 
chain and the first unit. Arg, of the bridging chain. 
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Iocorporated iato lb* gen* sequeoce I* *n Hpall restriction 
enxyae till i* til* *••« reading fraae and location at an Hgall 
lid |a the huaan Insulin gene. After purification of the 
ligated geo* fragoent by polyacrylaalde gel electrophoretl*. 
and elution of th* largest OMA bead, the fragaent was inse-ted 
Into the plasaid pBR322 that bad been cleaved with restriction 
endonucleases Hindi !! and BaaM I. thereby utlllxlng the Hlnd tll 
and BaaMI sites on the synthetic geae fragaent. Th* OMA was 
Inserted Into E. coH 294 (ATCC Mo. 3144S) by transforaation. 

One plasaid. pB3* recovered froa an aopicilltn resistant, 
tetracycline sensitive clone was found to possess the desired 
nucleotide sequence according to the aethod of A.H. Maxaa 
et iK Proc. Matl. Acad. Set USA 74 560 (1977). 

Froa the two plasaids pBH I and p33*. two OMA 
fragaents were recovered, a 46 base pair EcoRI to Hindlil 
fregaent froa pBHl, and a 58 base pair HindHI to BaaMI 
fragaent froa pBJ*. 

The two fragaents were ligated together to produce 
a fragaent having an EcoRI site and a BaaM I site. This 
fragaent was Inserted in plasaid pBR322 which had been treated 
with EcoRI and BajHI restriction endonucleases using the aethod 
described in Coeddel et al_. Proc. Bat. Acad. Sci.. 03A . 7S, 106 
(1979) and cloned in E. eol_^ K-12 strain 294 (ATCC Mo. 31446) 
to provide the plasaid p!B3. After cloning, the plasaid pIBJ 
was cleaved with Eco RI and Mpa l! restriction endonucleases to 
recover the synthetic gene fragaent (fragaent I. Figure 3) 
containing the codons for the N-teroinal proinsulin aaino acids 
preceded by a aethionine codon as shown in Figs. 1 and 3. The 
synthetic gene was isolated by polyacrylaaide gel 
electrophoresls • 

2. Isolation of A cONA Gene Coding For the 55 C-Ter»in*l 
Aaino Acids of Muaan Proinsulin 
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To, tent®* far obtain*** tn. eONA goo. it sch..atlc*11y 
show a la n$* 2. 

A d.caouct.otid. »« syotb.six.d containing tbt 
recognition sequence far Bart! endonucleases to abieb *** ad * 

« 3* polythyaidylie aeld tract of approsiaately 20 rttidoot. 

Itt sequence it pCC66ATCCCCTT l# T. Tbit oligonuclootid. ®*t 
astd to pri®. AH* reverse trtntcriptatt for cOMA tyntbttit. 

Tbt prlatr was pr.par.d using t.ralnal d.osynucleotldyl 
transferase (Ento Bioebt®. 200 onitt) with ant p®olt of tb. 
a ,.h i dtcanucitotidt i» a r.aetion eoluae of 0.« ®l containing 
l.S * 10' 4 TTP. Tbt reaction was conducted at 37*C for one 
hour in • buffer tyttea dttcrlbed by A. Chang et aU Matu re. 

27S . 617 (1978). 

Hunan Intullnoaa polyA tittut (2.S vg) proeldtd by the 
Inttftute fuer Oiabetttfortebung. Mutncbtn. West 6er®any 
(Dr. Wolfgang Ke®mler) containing ®RMA itolattd by tb. procttt 
of Ullrich tt al. Sd.net . 196, 1313 (1977) was cone.rt.d to 
double ttranded cOMA by a proc.dur. according to Wlek.nt •» •!• 
j. aiol. Chen. 2S3. 2483 (1978). That. 80 .1 containing IS ®M 
Trlt/HCl (pH 8.3 at 42*0. 21 ®H KC1 . 8®H HgClj. 30 ®H 
i-uercaptoetnanot , 2 ®H of the primer dCCG5ATCCGC77 l8 T. aod 1 
■H dMTPt was preincubat.d at 0*C. Then 40 unitt of AH* reeerte 
trantcrlptase -ere added and the alature incubated for IS 
■invites «t *2 C. 

The complementary cOHA ttrand oat tynthetiied In a 
volume of ISO ■! containing 25 mH Trit/HCl (pH e.3), 3S mH KC1, 
4 nH HgClj. IS mM t-mercaptoethanol and 9 units of DMA 
polymerase l (Klenow fragment). The mUtur. wat Incubated at 
for 90 minutet followed by 15 hours at 4 C. SI nucleate 
digestion wat then performed for 2 hours at 37*C using 1000 
units oi >1 nucl.as. (Hlles laboratories) as described by 
uickens et aK supra . The double stranded cOMA (0.37 pg) was 
subjected to tlectrophor.tlt on an 8 percent polyacrylamide 
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9*1• OMA frigstAts larger than $00 b«$t pairs «trt eluted. 

01 igodeoxycytldylic add residues were added to the 3* ends of 
the fragments using teraioal deoxyaucleotidyl transferase by 
the procedure of J.f. Halxel Jr.. Hath. Tirol. . $. 180 (1971). 
The dC tailed COMA fragaents were annealed to pBR322 that had 
been cleared with the restriction endonuclease Pstl and tailed 
with deosyguanldylic acid using terainal deoxyaucleotidyl 
transferase. The resulting plasaids were traasforaed into 
C. coll K-a2 strain 294 and cloned. Colonies resistant to 
tetracycline but sensitive to eapfclllln were isolated and 
screened for plasaids having three sites cleavable by the 
restriction endonuclease Pst l indicative of the presence of the 
gene for insulin. Sures rt aK Science , 208 . S7 (1980). 

One plasaid. pHIlOd, containing a 600 base pair Insert 
and giving the anticipated Pst l restriction pattern was 
deterained to contain a site cleavable by BaaH l between the 3* 
polyA and the polyCC introduced during the cDUA preparation. 
Soae of the nucleotide sequence of the Insert Is shown In 
Fig. 1. This sequence differs slightly froa that previously 
reported by I. Sures et aK Science . 208 . 57 (1980) and G. 

8ell, e,t aK Mature . 282 . 525 (1979). having an AT base pair 
where underlined rather than a CG pair, because the aRMA used 
was froa tissue isolated froa a different individual. The 
resistance to antibiotics conferred on a bacteriua by this 
plasaid is Indicated by the aarker Ap S for aapicillin 
sensitivity and Tc r for tetracycline resistance. 

3. Asseably Of A Gene Coding For Huaan Proinsulin 

The scheae used for asseabllng a gene coding for huaan 
proinsulin is shown in Fig. 3. 

The synthetic gene segaent coding for the first 31 
aalno acids of proinsulin, fragaent 1 in Fig. 3. was recovered 
froa 50 »9 of the plasaid plB3 using the restriction endo¬ 
nucleases Cco Rl and Hpa l1 as described above. This fragaent 




0055945 


- 16 - 


contains We codoa ATC for aethlcalae la place of Wo 
•presequence' of preprotasulIn. Introduction of o aethioolae 
aait »t this pofat peralts the polypeptide eltiaotely expressed 
to oc cleaved at Wit polot by cyoaopeo broalde (C«»r» to 
separate We pralotglla froa We retldoe of We polypeptide 
which served to protect the prelatalla pertloo frea baeterlal 
proteases. 

The COMA peoe sepaent coding for aalao acids 32-66. os 
.ell as the traaslatioo stop eodoas sad We 3’ oatraaslated 
repior. of the oRMA was recovered froa *0 v9 of <•»« p1«**1d 
pH!10* by treatoeat first with BaaHI aad thea H£al! as sho-a la 

Fi;. 3 as fragoent 2. 

The two fragnents were Isolated by polyacrylamide 
electrophoresis followed by electroelution. The peae frapoeats 
were joined by treataeat with T« DMA lipase ia 20 .1 lipase 
boffer (Goeddel et aK Proc. Mat. A rad. Pel.. PSA. 76. 106 
( 1573 ) , t a*C for 20 hrs. The aixture was diluted with 50 vl 
HjO. extracted with pheaol. thea chlorofora aad thea 
precipitated with ethanol. 

The resultiap OMA was treated -1th B«HI aad EcoRl to 
repenerate these sites aad reaove pens polyaers. The asseabled 
proinsulla peae -as Isolated by polyacrylaaide pel electro¬ 
phoresis and Upated usiap TO lipase to the plas.id pBS322 
which had previously been treated with EcoAl aad BaoHl. The 
resultinp OMA was tr.osfor.ed late t. coU R-12 strain 290 and 
cloned. Colonies -ere screened usiap the plasaid conferred 
antibiotic resistance barkers.' The desired clones were tetra¬ 
cycline-sensitive (Tc*) aad aaplclllIn-reslstaat (Ap r ). 

Plisaid p«I3 was Isolated froa one such colony and the 
proinsulin was ch.r.cterlted by nucleotide sequence analysis 
and found to have the sequence shown In Flp. 1. 

4. Construction of a Plasaid Oesipned to express a 
r.nioer ic Protein Containinp the Huaan Proinsulin Peptide 
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Plasaid pBRHl, (I.I. Rodriguez. «t al_.. Muclele Acid* 
Research C, 3257-32*7 (1979) expresses aapicillin resistance and 
contains the gene for tetracycline resistance but, Were being 
no associated proaoter, does not eapress that resistance. The 
piasaid is accordingly tetracycline sensitive. *y introducing 
• proaoter-operator systca in the EcoRl site, the piasaid can 
be aade tetracycline resistant. 

Piasaid pGM carries the t. coli tryptophan operon 
containing the deletion LE14I| 16.F. Ntozzari, et aK, (197SJ 
J. Racterlology 14S7-1466)) and hence ezpresses a fusion 
protein coaprising the first 6 aaino acids of the trp leader 
and approxiaately the last third of the trp E polypeptide 
(hereinafter referred to in conjunction as LE*1. as well as the 
trp D polypeptide in its entirety, all under the control of the 
trp proaoter-operator systea. The piasaid, 20 «g, vas digested 
with the restriction enzyae Pvull which cleaves the piasaid at 
five sites. The. gene fragaents 2 were ne*t coabined with EcoRl 
linkers (consisting of a self coapleaentary oligonucleotide 3 
of the sequence: pCATGAATTCATG) providing an EcoRl cleavage 
site for a later cloning into a piasaid containing an EcoRl 
Site. The 20 «g of OMA fragaents 2 obtained froa pGKl were 
treated with 10 units of T 4 ONA ligase in the presence of 200 
pica aoles of the S'-phosphorylated synthetic oligonucleotide 
pCATGAATTCATG and in 20 »1 T t OMA ligase buffer (20aM trls. 
pH 7.6. 0.5 oM ATP. 10 oH HgClj. SaM di thiothreitol) at 4*C 
overnight. The solution was then heated 10 ainutes at 70*C to 
halt ligation. The linkers were cleaved by EcoRl digestion and 
the fragaents. now with EcoRl ends were separated using 5 
percent polyacrylaaide gel electrophoresis (herein after 
•PAGE*) and the three largest fragaents isolated froa the gel 
by first staining with ethidiua broaide. locating the fragaents 
xith ultraviolet light, and cutting froa the gel the portions 
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of interest. Each gel fragment. «»U 300 mlcroliters O.UTBE. 

«tt placed in a dialysis bag *»d subjected t* electrophoresis 
«t 100 v for on* hoof U O.laTBE boffor (TBE boffor contains: 

10.8 ]■ trip base, 5.5 9* boric acid, 0.09 9* BajEOTA lo 1 
liter HjO). The aqueous solutloo was collected fro* the 
dialysis bap. phenol eatracted. chloroform eatracted and oade 
0.2 M KaCl. and the 0»A removed lo MjO after CtOH 
precipitation. 

pBRMl was digested with EceBl and the entyoe reoo.ed by 
pnenol eatraction followed by chloroforo eatraction and 
recovered in water after ethanol precipitation. The resulting 
Dna aole^ule was. In separate reaction oiatures, coobined with 
each of the three ORA '.'agments obtained above and ligated with 
T t OKA ligase as previously described. The OKA present in 
the reaction mixture was used to tranform competent E. coll. 
c-12 strain 29«, R. Backoan et C.. Ptoc Kafl Acad Sci USA 73. 
417C-4198 [1976]) (ATCC no. 31««6) by standard techniques (V. 
hershfield et C.. Proc Kafl Acad Sci USA 71. 3«SS-3«S9 
[1974]) and the bacteria plated on IB plates containing 
20 „g/o1 aapicillin and S *g/ol tetracycline. Several 
tetracycline-resistant colonies were selected, plasoid OKA 
isolated and the presence of the desired fraguent confirmed by 
restriction entyoe analysts. The resulting plasoid was 
designated pBRHtrp. 

pBAH trp was digested »ih EcoRl restriction entyoe and 
the resulting fragment isolated by PACE and electrocution. 
EcoRt-digested plasoid pSOHall (R. Itakura et C.. Scien ce 198 
10S6 (1977); 6.B. patent publication no. 2 007 676 A) was 
coobined with this fragment. The mixture was ligated with T 4 
OnA ligase as previously described and the resulting OKA 
transformed into E. eoM R-12 strain 29« as previously 
described. Transformant bacteria were selected on 
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eapicilHh*eootalolng plitii. Resulting aapiclllin-resistant 
colonies ware screened by colony hybridization (N. Cruenstein 
at al... *roe HatM Acad Sel USA' 72. 3951-39(5 C1975J) using at 
a proba tha trp proaoter-operator-contalnlng fragaent fselatad 
frea pBANtrp, which had baaa radloaetively laballad with 
9 3 *. Several coloala* shown positive by coloay hybrid! zatlor 
wara selected. plasald DMA was isolated aad tha orlantatloa of 
tha lasartad fragatats dataraiaad by rastrlctloa analysis 
•■ploying restriction aaxyaas Bgl 11 aad SaaHI la double diges¬ 
tion. C. coll 294 containing the plasald designated pS0H7a2. 

Plasaid pBR322 was HlndMI digested and the protruding 
Hlndlll ends In rn digested with SI nuclease. The SI 
nuclease digestion involved treatnent of 10 »g of 
Hindi 11-clcaved p8R322 In 30 .1 >1 buffer (0.3 M HaCl. 1 an 
ZnClj, 2S oH sodiua acetate, pH 4.5) with 300 units SI 
nuclease for 30 ainutes at iS*C. The reaction was stopped by 
the addition of 1 el of 30 X SI nuclease stop solution (0.8H 
tris base. SO an COTA). The aixture was phenol extracted, 
chiorofora extracted and ethanol precipitated), then EcoRl 
digested as previously dese-lbed and the large fragaent 1* 
obtained by PACE procedure followed by electroelution. Tne 
fragaent obtained has a first EcoRl sticky end and a second. 
Olunt end wnose coding strand begins with the nucleotide 
thyai dine. 

16 *9 pSOH7p2 was diluted into 200 *1 of buffer 

containing 20 nH Tris. pH 7.S, 5 oH HgClj. 0.02 percent NP40 
detergent. 100 oH HaCl and treated with 0.5 units EcoRl. After 
15 ainutes at 37*C, the reaction aixture was phenol extracted, 
chlorofora extracted and ethanol precipitated and subsequently 
digested with Bgl II. The larger resulting fragaent 3* was 
isolated by the PACE procedure followed by electroelution. 

This fragaent contains the eodons *lE'(p)* for the proilae! end 
of the lE‘ polypestide. i.e., those unstreaa froa the Bgl 11 



0055945 


-20 


sit*. The fragaeat 3* was “•*« Hg*«e4 to the frogaeat «*. 
Fragaeat <* It prepare* by successive digestion of pTrp24 
(prepared open EcoBI digest of pThal flothealttry 80. 6096 
(1980)1 followed by Kltaow polyoeroto 1 rooetloa to bloat the 
CcoRI reslduet. Igl II digestioneceaeea a linear fragaeat which 
-a, recircolarlxed by roactloa with the If containing Bgl II 
sticky and bloat ends) with Bglll and EcoBI. followed by FACE 
<nd eleetroelutloo. The ligation was done la the presence of 
ooA 11 gate to fora the plasald pS0M7a2a4, which was 
transfers.* Into E. coH strain 294. as previously described. 

Masald P S0M7a2 was Bgl II digested and the Bgl II 
sticky ends resulting aade double stranded with the Klenow 
polyaerast I procedure using all four deoaynucleotlde 
triphosphates. EcoBI cleavage of the resulting produet 
followed by PACE and electroelutton of the snail fragnent 2* 
yielded a linear piece of DMA containing the tryptophan 
prosoter-operator and codons of the IE’ •proalnal* sequence 
op, trees fro. the Bgl II site CUE*(p)*>. The product had an 
EcoRI end and a blunt end resulting froa filling In the Bgl It 
site. However, the Bgl II site 1s reconstituted by ligation of 
the blunt end of the fragaent 2* to the blunt end of fragaent 
1*. Thus, the two fragsents were ligated In the presence of 
T t DUA ligase to fora the reclreularlzed plasaid pHKT 10 
which was propagated by transfornatlon Into coapetent E. colj. 
strain 294 cells. Tetracycline resistant cells bearing the 
recosbinant plasnid pHKT 10 were grown up. plasaid OHA 
..tract.* and digested in turn with Bgl II and Pst followed by 
Isolation by the PACE procedure and electroelution of the Urge 
fragaent. a linear piece of OHA having Pst and Bgl II sticky 

ends to give OHA fragsent f . 

Plasaid p$OK7a2a* could be aanipulated to provide a 
second coaponent for a systen capable of receiving a wide 
varie.y of heterologous structural genes. The plasaid was 
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subjected to partial CcoRI digestion followed by Rst digestion 
end the fragments contelolng the trp proaoter/operator was 
isolttcd oy the RACE procedure followed by electrocution. 

Pertial EcoRI 'digestion was necessary to obtain a fragaent 
which was cleaved adjacent to the S‘ end of the soaatostattn 
gene hot not eleaved at the CcoRI site present between the 
aapiclltto resistance gene and the trp proaoter operator. 
Aaplcillln resistance lost by the Pst I cut In the ap # gene 
eea ld be restored upon ligation with fragaent £ . 

In a first deaonstration the third coaponent, a 
structural gene for soaatostatln {6*) was obtained and purified 
By PAGE and electroelution. 

The three gene fragaent. V. £* and 6' could now be 
ligated together In proper orientation, to fora the plasmid 
S0H7ala*. 

The coaplete huaan proinsulin gene. Including the 
K-terainal codons that code for aetbionfne.was recovered froa 
the plasaid pH!3 by treatment with CcoRI and SaaHl and purified 
by gel electrophoresis. This gene, fragaent 3 In fig. 3. was 
joined to two other DMA fragments with T« Ifgase; these are 
identified as fragaents * and S in fig. 3. 

Fragaent *' contains a proaoter and a carrier protein 
gene derived froa the plasaid pS0H7al&4 by partial digestion 
with CcoRI and coaplete digestion with Pst l. This fragaent 
contains an E. coli tryptophan (trp) proaoter-operator. nine 
codons froa the trp leader peptide. 190 codons froa the trp E 
gene and an Cco RI cleavage site introduced in place of the trp 
E termination codon. (This gene construction will bn referred 
to as trp if below.) The tryptophan attenuator region 
including the last S codons of the trp leader peptide sequence 
and the first two thirds of the trp E gene are deleted in this 


construction. 
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TUc trp £ gene (trp IE*). to»U<Md in Fragaent 4*. ** 
aodified to Incorporate i» [toll *lto !• ploco of the 
torat ootloo codon of the trp £ 9000 at shown to gi*o * h « 
correct reading fraae with the Inserted gene fragaent 3. 

This fragaent It bounded at tbe opposite end by a 
PttI site derived froa tbe p8«322 and Incorporate! tbe fint 
naif of tbe o-lactanase 9 ene. The fragaent vat recovered froa 
20 ,9 of platald pS0N7sla« by partial digestion with EcoBI 
followed by treataent with Fstl. The pronoter containing 
fra^ents were Isolated by polyacrylaalde gel electrophoresis- 

Fragaent S' was obtained froa plasaid pHKTIO. This 
plasald is a derivative of pSB322 and contains a tryptophan ' 
proaoter-operator in place r.' the tetracycline proaoter. The 
Hindlll site of p8B322 has been converted to a B^Ul site’. The 
plasaid. 20 * 9 . was treated wtlh Mt l and 8 ^ 1.11 and the large 
fragaent. designated 5 In Fig. 3. purified by polyacrylaalde 
gel electrophoresis. 

The two fragasnts 4 and 5 were ligated together to 
reasseable the gene for p-lactaa.se via a Fstl site and confer 
aapiclllin resistance (Ap r l. The ends then present an E£oBJ 
site and a 8*0.11 tile for insertion of a gene. These two sites 
cannot be ligated together due to nonhybridltation of the 3* 
protruding ends and can only be Joined by incorporating a OHA 
fragaent that possesses 3* ends coa^laaentary to the EcoBI and 
BjO. 11 ends. The proinsulin gene, fragaent 3 containing EcoBI 
and BaoMI ends, is such a aolecule. Thus, the three fragaents. 
5 .g of 4 . 1 .g of 3 and 1 .g of 5 were coablned and treated 
with 14 ONA 11 gate at 4*C for 24 hours in ligate buffer. Upon 
circularization to close the plasaid. the tryptophan proaoter- 
operator controls eipression of a fusion (chiaerie) protein of 
which proinsulin is a portion. Tetracycline resistance <Tc ) 


is also conferred. 
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Xh# OHA Mixture froM tut ligation was transforoed in 
t. coli, K*12 strain 294 by the procedure »( Goeddel et oK . 

■eture. 781* 541 < X9,9, ‘ C 01 ®" 14 * *•»«««< th4t 

9 rew •« both ampicillio end tetrecycline. Of 3 colonies 
tested. 2 were found by SOS polyacryl soldo pel electrophoresis 
CJ.F. Mallei. Jr., noth. Yirol. . S. 180 (1971)) to express . 
protein of the Molecular weight expected of the trp IE’- pro- 
insulin fusion. One plesoid. pHI7 was coopletely characterized 
as to 9HA sequence of the incorporated gene and restriction 
analysis of the vector p8R322. 

5. Proinsulin Isolotion 

The plnsoid pH!7 wis transformed Into. E. coll K-12 
strain RV308 (ATCC Bo. 31608) and grown in SOO Ml of LB nedlun 
(j H. Hiller, Eioerinents In Molecular Genetics, *33. Cold 
Spring Harbor 1972) containing 10 .g/nl of aapicillin to a cell 
density of S 00. This was diluted into a 10 liter fermentation 
vessel (Hew Brunswick) and grown in H9 Media (Miller, supra. at 
431.) to a cell density of 14 00. Cells were collected by 
centrifugation and frozen. 

Cells (16*g) were thawed in S volumes sucrose lysis 
buffer (10 percent sucrose, O.lK tris HC1, pH 7.9, SOiiM EOTA, 
0.2M MaCl) containing 0.1mm phenyloethylsulfonyl fluoride and 
1.0 mH dioerceptopropana!. ar.fi lysefi by sor.1eaf.on. The .y».» 
pellet was collected by centrifugation and suspended by 
Stirring overnight at 4*C with 4 volumes of 7.OH guanidine-HCl. 
ImH dimercaptopropanol. ImH EOTA. After centrifugation the 
supernatant was diluted 20 times with cold water and allowed to 
stand 2 hours at «*C. The precipitate (9.6g dry) was collected 
by centrifugation and reacted overnight at room temperature in 
220 ml 88 percent formic acid with $g CH6r to cleave the 
proinsulin from the trp Lf fusion. After rotary evaporation 
the residue -as suspended in 200 al 7.5 h urea. ImH EOTA. 20oH 
ammonium tarsonate, and the pH adjusted to 9.0 with 
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(tbiooUilftt {5.5 *11• T®o sodiu* su • 

grtas of sodiua tetratbionate were added to.contort cysteines 

and cystines to S-*olfonato group* and the reaction ttlrred at 

rooa teoperature for * hour*. 

The reaction aiiture was desalted on a 6-25 Hediua 
coloan I. 7.SH urea. lOaN Trl* pH 8.5. 10H COT*. The de*.lted 
protein -a* loaded onto a OEAE-s.phade. (A-2S) coloan and 
eluted -1th a 2-liter linear gradient of 0 to O.S H MaCl in the 
taae trti-orea Puffer. The pro1n*u11n like notarial. 
identified Py RIA or HPLC (see below), was concentrated on an 
Aoicon YUS aeoorane and resolved in the tris-urea buffer nn a 
c-so Mediua coloan. The G-SO fractions, identified by HPLC. 
„ere pooled (10«al) an: the buffer charged on a coloan of 6-25 
Fine equilibrated with 30 aN aoaoniua carbonate. pH 8.8. The 
lyophilixed protein weighed 216 ag. The recoveries at eaeh 
step are shown In Table 2. 

6. Prolnsulln Analysis 

The $-sulfonated proinsulin obtained was analyzed by 
..ino acid analysis. This analysis was node by Eli Lilly and 
Co. and is shown in Table 3. 

TABLE 3 



Aaino Acids 
Cal cut a ted 

Asino Acids 
Predicted 


Aaino Acids 
Cal cul a ted 

Aoino i 
Predlc 

Asp 

4.40 

4 

lie 

1.34 

2 

Thr 

2.90 

3 

Leu 

12.21 

12 

Ser 

4.50 

5 

Tyr 

3.93 

4 

G1 u 

15.64 

15 

Phe 

2.61 

3 

Pro 

3.42 

3 

His 

2.02 

2 

dy 

11.08 

11 

Lys 

1.96 

2 

Alt 

4.46 

4 

Arg 

3.92 

4 

Cys 

2 .85 

6 

va 1 

5.58 

6 
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TB« pre*«nee of prointuHe we* *l*o eatHnU bp ndU- 
iMunoessey. To determine the redioii*.unoectivity tbo CoroUp 
l25 |.ii»Mi kit we* «ed. Th * entibody wo* found to b« • oo 
4 poreont (rtii-ructlM with proio*elln end *boot 0.2 poreont 
crotweectlve with rodocod prointulin. Unknown* were h«4t«d 2 
•inotes 4 t 90-C. in 7.5W or*.. 2 oN .-..re.pto.th.uol. pH O-XO 
(eth.nole.ine). end eltquot* diluted in photphete-buffered 
„,,ne (0.1 peletin) i..edietely «*«yed. The*, result* 

.ere det.r.ined fro. eo.p.rison* to • reduced proin*ulin 
stenderd curve penereted in the *.-. •»* fro. either bovine 
prointulin or been proin*ulin S-iulfonete. 

The proin*ulin-S-*ulfon.te we* el*o ****yed by HPLC. In 
fip. 4 profile* «re *hown for '-tulfoneted Bovine prointulin. 
h.cteriel derived hu.en proinsulin end e eonbinetion of the 
two. in ti:is enelyti*. *eop1e* of bovine, end hu.e« prointulin 
tulfonete end e nieture of the two were epplied to e 10 . 

RP-189 coluon end eluted utinp e lineer pr.dient of 21 to 33 
percent n-propenol end eeetonitrile (2:1) in SO .M 4 MH 0*e 
(pH7). The protein* ere teen to run very neerly coincident. 

The lerpe * 278 peek et the end of the ehronetopre. i* due to 
repid chenpe* in the solvent co.potition end not eluted protein. 

8. preperetion of Prointulin Anelop 

Oetcribed below 1* the tyntneti* of * pene which codes for 
the express ion of en enelop of prointulin co.prisinp the A end 
8 cheint of hu.en intulin connected by e brldpinp chein which 
differ* fro. the C chein of hu.en prointulin in thet it 
contein* only e.ino ecid unit* rether the- the 3S unit 
polypeptide of hu.en prointulin shown in rip.l. Specifieelly 
the « unit* ere. re.dinp in order fro. the lett unit of the 8 
eh.in to the A chein. Arp-Arp-Cly-Ser-lyt-Arp. Thi* sequence 
end sequence* Arp-Arp end lyt-*rp e* does hu.en 


het the teoe 
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proinsulin thus permitting eicission of the bridging chain by 
proteolytic means. 

A cnoin of 6 aaino acids Is aa acceptable leagtb for a 
•edified (or analog) bridging *C* chain which will permit 
folding and the subsequent formation of the disulfide 
crosslinks between A and B chains characteristic of hormone 
Insulin. However, those skilled in the art will appreciate 
tnat bridging chains shorter or longer than S would also be 
useful as well by permitting folding and the foroatiqn of the 
necessary disulfide bonds. Sequences of 100 or even aore aaino 
acid units can b« employed in the bridging chain. However, the 
practical difficulty of obtaining gene fragnents coding for 
*er y long sequences nakes the bribing chain analog' of few^r 
than 35 aaino acid units aore attractive from a practical point 
of view. 

The ends of the bridging chain, no aatttr how aany 
intermediate aaino acid units or in what order, aust be 
constructed to perait excission of the bridging chain. 

Although alternative neans aay be enproyed. we prefer to use 
the sequences Arg-Arg and Arg-lys as found in proinsulln itself 
as proteolytic cleavage using trypsin and carboxypeptidase B 
occurs cleanly at these sites. 

1. Preparation of Synthetic Gene Coding for the 57 Aaino 
Acids of an Analog Proinsulin u 

a. Oligonucleotide Synthesis 

The chemical synthesis methods as well as the 
synthesis of the DMA gene fragments coding for the A and B 
chains of huoan insulin have been described, k. Itakura 
et * 1 .. j. aiol. Chen. . 250. *592 (197S). 1. Itakura ii *1- ii 
Biol. Chao .. 2SO . *S92 (1975). t. Itakura et aK J. Aa. Chen. 
Soe.. 97. 7327 (1975). Crea et aK Proc. Hat. Acad. Sci.. USA, 
7S, 5765 ( 1978) and Goeddel et a_l_. Proc. Nat. Acad. Sci. . USA. 
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H. 106 (1979). five »ew oligonucleotide fregaeacs-were 
synthesited 0/ ilsttir aethods also using the triester process 
described la the above cited refereaces. These sequences «re 
shown below ia Teblo 4 ead flj. 9. 


TABLE 4 


Oligonucleotides 


C 

C 

C 

C 

C 


1 

2 

3 

< 

S 


AAGACTCGTCGTC 

GATCtAAStGTCGCATC 

CATCCACCACGAGTCTT 

CAACGATGCCACGCTTG 

TCSACTATTAGTT 


b. Joining of Synthetic Oligonucleotides 

figure S shows the synthetic oligonucleotides of 
the insulin A end B ehein genes previously prepered end the 
aenner in which the new fregaents Cj-Cj were used In the 
enzyaetic construction of e coaplete gene for e proinsulin 
enelog. The scheoe for obteining this gene is set forth in 
Fig. 6. 

A plesaid pCHl conteining the left fcjlT : of the B 
chein gene wes used in this process end is described by Goeddel 
e t «l_., Proc. Met. Aced. Sci.. USA. 76. 106 (1979). It 
contains the codon* for the 1-13 eaino acids of the 8 chein end 
e aethiontne unit et the B-terainel which will be used later to 
cleave the proinsulin enelog froa the becterielly expressed 
chleeric protein using cyanogen broaide (CKBr). 

The gene for the right half of the 9 chain wet 
obtained froa the oligoruclaotldo* -Bj.-dg. Bj. Cj, 

e 6 end e 7 . Bj. B, end Cj (C, end Cj replacing me 
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• 5 and l| Q sequences In the previously prepared peat frep. 
seat) by ligation using T* lipase ia conventional techniques. 

This peae frapaeat codes for the l*-30 aatao acid entts of the 
P chain aad the first two onlts Arg-Arg of the brldpfnp 
(aotflfled *C*) chain-pPCl. 

After perlflcation by polyacrylaalde pel electro¬ 
phoresis and elution of the larpest OMA band the frapaeat was 
inserted into the plasaid pBR322 that had been cleared with 
restriction endonucleases hiadl ll aad Parti, thereby utiliziap 
the Hiadl ll and Part i sites oa the peae frapaeat. aad clone* <• 
C. coll 29* (ATCC io. 3U46). The plasaid pBC recovered froa 
an aapicillin-resistaat. tetracycline sensitive clone possessed 
the desired nucleotide sequence according to the aethod of A.H. 
Haiaa et aK. Proc. Hat. Acad. Sci.. USA, 7* . S60 (1977). 

The A gene was constructed siailarly froa 
oligonucleotides Cj, *j» A 4 , Aj, Aj, C 4 . 

A 8 . a,. A l0 . Aj, anc C s (Cj. c 4 and C $ 

replacing tne Aj. Aj and A J2 sequences in the previously 
prepared gene fragaent) using T* ligase In conventional 
techniques. This fragaent codes for the 21 aaino acids of the 
A and the four units of the bridging C chain. Cly-Ser-iys-Arg. 
After purification. also b, polyacrylaalde electrophoresis, the 
fragaent was inserted into the plasaid pS2322 which had oeen 
cleaved with restriction endonucleases CcoRI and SaM using the 
CcoRI and Sai l sites on the fragaent and cloned in £. col I 
294. An aapicill<n-resistant, tetracydine-sensit1*e clone 
yielded plasaid pCAlP having the desired nucleotide sequence Dy 
the aethod of A.V. Kaiea et aK . supra . 

2. Construction of the Proinsulin Analog Gene and 
Corresponding Expression Plasaid 

The desireC expression plasaid was prepared froa 
plasaids pEHl. prc 1 and pCA18 as sho-n ir. fig. 6. The plasaid 
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p»Hl w.s cle***< with mod Ul *nd 1 19 .tee to too fr.gae«t BC 
•ocKod fro. pl.s.id pBCt 9y tre.t.ent -Ito jModlll .nd B**U • 

Too resattl *9 pl.s.H P«C»5 «»••••« *‘ lh i£l #l *" 4 

to to Ecoil fr.poeot of p 1 *«S *»«eb eoototos toe toe csotrol 
,„to. ..d to. ••Jorttjr of to. p.«ol«cusfM>o straet.r.l 9«« 
td.sl 9 n.ted ». Itokar. £t 11- XrUoeo. IU. 10S6 t!9f7). 
tots ll*.tfo. produced pl.s.id P»»* -«co -os d«...d -««• 
p^,. Siil ..d .Ifcol1o* poospootoso. To. product of tOI. 

-•* tf 9 »t«d to fr. 9 **ot CA ..Cls.d fro. pt.s.ld pCA18 
b, tre.tnent with B£«l ..d SHI « Shown in F«9- « «« 
foro.C into £. eoU 29* .nd cloned. To. pl.s.id pBCAS -.s 
recovered fro. ..pielll In-reslst.nt. tetr.eycl ine-sencit 1 »e 
clones 10 £. eoU 29* grown on 1-9*1 P>*t« cont.lnln, 

..pielllin .»* cont.ln.d toe DBA coding sequence for toe 
prolnsul in * 0.109 • » mulcted By toe ..tood of A.K. «**•• 
et *!•. suor*. 

j. E.presslon of Prolnsulso An*to9 

Toe fully co.r.cterfzed pl*s.1d pBCAS *«s Inserted into 
E. eoH 8 T JOB *nd grown 1r four-liter flists eont.ining l.S 
liter LB eont.ining 29 -gfl ..pieilll". B.cor.red cells (322 9 
-et) -ere lysed Oy sonic.tlon In two liters of 10 percent 
sucrose. SO ■* EOT*. 0.1 H trfs/HCl. pH 7.9. 0.1 N 

-Renyl!»«tnylsul fonyl fluoride. 0.2 H H.C1 . .nd 1 nil 

1 , 3 -dltnio- 2 -prop.nol. After c.ntrlfug.tion (30 • <». SOOO rpo) 
toe pellet «•* *«*P»»«*« Stirring in .90 .1 7 h gu.nldine 
Hydrochloride. 0.1 oH. 1.J-dithf0-2-prop.nol. 1 m* EOT*. Tols 
suspension w.s centrlfujed (30 .in. 12.000 rp.) .nd the 
supern.t.nt diluted si.-fold Into cold w.ter. The 
preclpit.tlon p.otein (12.* 9 dry -.i ? *t) collected »y 

centrlfvg.tion (20 .*n. SOOO rp.) «nd treeted overnight «t »T 
with 2.8 9 (26.* -ol) CHBr in 200 .1 SB percent for.ic .ci< to 
Cle.ve the proinsulin .n.log (hereinafter *en» 1 og--C-- 
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proiasolIn) froa the a-galactosidase residues at toe 
S-aethfonine unit. After rotary traporatioo to dryness at 
•■der 30*C, voter was added and the voluae again reduced. Tbe 
reside? was suspended ie 300 al 5 H geaoidlne Hydrochloride and 
the pH adjested to 9.0 with ethanolaaiae. To this was added 
1S.0 g sodiea sulfite and 7.5 g sodiea tetratbionatc to convert 
cysteines tod cystines to cysteine S-sulfonate groups and the 
aixtore allowed to react at rooo teaperatare for six hours. 

The reaction aixtore eas exhaustively dialyzed (Spectropor 3) 
against 1 an COTA at **C aod the precipitated protein 
S-solfonates collected by centrifugation {20 afn, 5000 rpa). 

The pellet was suspended in 50 nl 7.5 oK Tris/HCl, pH 7.5, 
filtered and loaded onto a 9C-S2 coluan (2.5 x 87 c»i in the 
saae buffer at 4*C. The coluan was eluted with a linear 
gradient of 0 to 0.5 H UaCl in the sane buffer. The presence 
of nini-C proinsulin S-sulfontte was detected by insulin RIA 
analyses described below to elute at the end of the 00 peat. 

The pooled fractions were applied to a G-50F Sephadex coluan 
(2.5 x 100 cn) and eluted with. 7 H urea, 50 nM trls/HCi. pH 
7.5, i aH COTA, fractions containing ;*isul In-RIA-active 
•aterlal uere pooled and dialyzed against 20 aM anaeniun 
carbonate, pH 8.8, and lyophiltxed. The white powder was 
resuspended in 7 al 20 aM aaaoniua carbonate, pH 8.8, and 
stored at -40*C. A portion of the product pool was further 
purified by preparative HPlC and the insulin RlA-active peak 
analyzed for aaino-acid content which is shown In Table 5. 

The anal 09 -"C* proinsulfn S-sulfonate was purified by 
HPLC by prep-col 1ection froa an analytical C-8 coluan. The 
resolving gradient «as 21-39 percent 2-propanol in 50««H MaOAc, 
pH 7.0, at 0.6 percent/ain and 2 al/ain. Up to 1.5 aol- of 
protein solution was resolved in one run by three successive 
injections via a 500 «1 loop. 
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TABlC S 

AaUo Add Analysis of l«-ho.r 6» HC1 hydrolysis CllO*) of 
purified en.109--C* proinsulln. Cysteine -as quantitated by 
teperote deter.lnotlon of eysteic odd on perfor.ic odd 
osidized.to-ple ond calculated by eysteic/oionine rotio. 
yol.es increased to co.pe»sate for ocid deco-position -ere 
rorin. CIO percent) and threonine (5 percent). -.0. - 
deter 


Aaino Add 
predicted 
Ale 
Ar9 
Asp 
Cys/2 
G1 u 
Cly 
His 
T.e 
leu 

iy* 

net 

Phe 

Pro 

Ser 

Thr 

Trp 

Tyr 

V<1 


aa pole percent x 57 


1.19 
4.06 
3.17 
4.87 
6.95 
5.37 
2.03 
1 .39 
5.98 
2.02 
0 

3.03 
1.96 
4.21 
3.09 
N.O. 
3.99 


awl no acids 


1 

4 
3 
6 
7 

5 
2 
2 

6 
2 
0 

3 
1 

4 

3 
0 

4 


3.74 


4 
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4. Folding Of Analog *C* Prolnsulin 

Analog *C* prolnsulin folding to obtain a crosslinked 
fora wot accomplished by reaction of A ag of protoin containing 
about JO percent analog *C* prolnsulin tolfonatc. Tbo protein 
was dissolved 1n-a degassed buffer of 40 aN glycine. pH 10.6. 3 
^ grea. 0.3 H HaCl at 0 . To this was added »-aercaptoetbar.ol 
to a concentration of 6.4 aM and the reaction sealed under 
hj. The course of the reaction was followed by aeasoring the 
Increase in RIA activity and was coaplete within about four 
hoars. The reaction was stopped by the addition of acetic acid. 

The reaction aizture was purified by HPIC by 
prep-collection froa a C-18 Ultrosphere column. The resolving 
gradient was 21-28 percent acetonitrile in 0.2a aaaoniun 
sulfate. 50 aM HaOAc. pH 4.0. at 0.2 percent/ain and 1.0 al/ain. 

S. Assay For Analog *C* Prolnsulin 

Analog *C* prolnsulin has no crossreactivity in the 
insulin radioiaauno assay as the S-sulfonate. However, the 
foraation of analog *C* prolnsulin a; an expression product was 
conflraed by crossreactivity of the thiol fora. The samples of 
unknowns were treated for two ainutes at 90*C with 1 n»M 
a-aercaptocthanol. diluted into R1A buffer (0.1 H sodium 
phosphate. pH 7.4. 0.1SM NeCl. 0.1 percent NaNj. 0.1 percent 
gelatin) and immediately assayed. Reproduceability depends 
upon strict timing since extended incubation of the di’uted. 
reduced test solution leads to variable oxidative folding of 
the molecule into forms with higher RIA activity. 

The thiol form gave an activity of 0.9 percent compared 
to insulin’s 100 percent. By comparison, oovine prolnsulin had 
RIA activity of 7.3 percent of insulin activity and the tniol 
form of bovine proinsulin had an activity of 1.9 percent. The 
third fora of the 8 chain of porcine prolnsulin had activity of 
0.1 percent. On incubation with a slight excess of 
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s-aercaptoethanol to obtain the folded. crosslinked product, 
reaction aUtures thou BIA activity of 20-40 percent thet of 

IasuHa. ^ 

C. Preparation of Hunan Insulin 

l. Folding end Unking of A end » Chelns. 

The hunen prolnsulln or enelogt thereof, prepered in 
tccordence with the present invention. for esenple. following 
the procedures of Ports A or • above. es their respective 
S-sulfonetes ere induced to fold with proper fornetion of 
internet disulfide bonds (between cysteine ? B end cysteine 
; A, between cysteine l9 B ena cysteine 20 A. end between 
cysteine^ n A) by neons of controlled sulfhydroxyl 
interch.nge catalyzed by s-nereaptoe*.'enol. 

To e 0.1 ng/nl solution of proinsulin S-sulfonate in 
degessed SO aM sodiua glycinete. pH 10.6, et «*C was edded 
i-nerceptoethenol to e fine! concentration of 0.3 -M. After 
four hours, the reaction Is essentially coaplete as neasured by 
the increase in cross-reacting activity of the oisture in 
Insulin RJA. The yield of proinsulin is about 80 percent. 
Proinsulin is tnen purified froa side products by gel 
peraeation. ion escnange. end/or reverse phase high pressure 
liquid chroaatogrephy to yield product in substantially 
purified fora. 

2. Excision of Bridging Chain 

The huoar. proinsulin or analogs thereof, prepared in 
accordance with the present invention, for esaeple. following 
the procedure of Pert C. 1. ebove. are proteolytically convetted 
to insulin for exaaple in accordance with the procedure of 
Renal er et •!... J - Biol Chea. . 246. 6786 (19U1. The obtained 
insulin is then purified by eoluan chromatography or zinc 
crystal 1liation to yield product .n substantially 
fora, identical to natural huuan insulin ana freed of 
piologicelly active contaminants. 
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While the Uvta»loa la its aost preferred eabodlaent is 
described with refereace to C. coli t other alcrorganisas could 
likewise scree as host cells, for exaapte, /tests such as 
Saeeharoayeci crrtTilUc* IdCilli soch as lacings cad 

preferably other eaterobacteriaceae aaong which acy be 
aeotloaed as exaaples SalaoaeJla teehlanriua aad StTtAllk 
■irfMiai . utilizing plasaids that caa replicate aad express 
heterologous geae sequeaces la these orgaaisas. The 
faveatior Is aot to be Halted to the preferred eabodiaents 
described. 
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CLAIMS: 

l. A chimeric polypeptide comprising: 

a) the polypeptide sequence of a pcoinsulin comprising . 
the * end B chains of human insulin connected by a bridging 
chain of at least 2 .amino acid units, said bridging chain 
having sites at each end which permit its excision from 

between said A and B chains; and 

b) an additional protein or protein fragment ; 
there being a cleavage site at or adjacent said additional 
orotein or fragment and adjacent one end of the polypeptide 

* m 

sequence of said proinsulin.. 

2. A chimeric polypeptide according to claim 1 wherein the 
amino acid sequence of the bridging chain corresponds to that 
of the C peptide of human proinsulin. 

3. A chimeric polypeptide according to claim 1 wherein the 
amino acid sequence of the bridging chain is Arg-Arg-Cly-Ser- 

Lys-Arg. 

4. A chimeric polypeptide according to claim 1 wherein the 
amino acid sequence of the bridging chain is Arg-Arg. 

5 . A chimeric polypeptide according to claim 1 or claim 2 
wherein the sites permitting excision of the bridging chain 
are the amino acid units Arg-Arg at the B chain end and Lys- 
Arg at the A chain end. 
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6 . A chimeric polypeptide according to any one o£ claims 1 
to 5 wherein said cleavage site is a methionine unit. 

7 . A chimeric polypeptide according to claim 6 wherein the 
methionine unit is adjacent the N-terminal of said proinsulin. 

8 . A chimeric polypeptide according to any one of claims 1 
to 7 wherein the additional protein or fragment is either 

a) at least a substantial portion of S-galactosidase; or 

b) a fragment of the trp leader polypeptide fused to a 
portion of the trp E polypeptide. 

9 . A process of producing a chimeric polypeptide according 
to claim 1 comprising the steps: 

1 ) inserting a gene coding for said proinsulin into a 
nicrobial cloning vehicle in which the gene is in reading 
phase with a DNA sequence coding for said additional protein 
or fragment comprising said cleavage site; 

2 ) transforming said cloning vehicle containing said 
inserted gene into a microbial host for expression of said 
chimeric polypeptide; 

3 ) expressing the chimeric polypeptide; and 

«) isolating the expressed chimeric polypeptide. 

10. A process for producing a proinsulin comprising the 
process of claim 9 and the additional step of cleaving said 
chimeric polypeptide to release said proinsulin. 
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chain froa said proinsulin. 

11. A process according to claia 10 wherein 

the excision of the bridging chain is preceded by the 
formation of disulfide bonds between the A and B 
chains and the product of excision is human insulin. 

12 . A process according to any one of the preceding 
claims wherein step (2) affords a viable culture of 
microbial transformants containing the cloning vehicle, 
which culture is cultivated to perform step (3), wherein 
step (4) comprises separating the resulting cellular 
mass; and isolating a said chimeric polypeptide from it. 


13. A method of producing human insulin comprising: 

1 ) cultivating a viable culture of microbial 
transformants containing a cloning vehicle suited for 
transformation of a microbial host and use therein for 
expressing a chimeric polypeptide comprising: 

a) the polypeptide sequence of a proinsulin 
comprising the A and B chains of human insulin connected 
by a briding chain of at least 2 amino acid units, said 
bridgirg chain having sites at each end which permit 
its excision from between said A and B chains; and 

b) an additional protein or protein fragment; 
there ieing a cleavage site at or adjacent said 
additional protein or fragment and adjacent one end of 
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th® polypeptide sequence of said proinsulin; 

2 ) separating the resulting cellular mass; 3) isolating 
the precursors to human insulin comprising the chimeric 
polypeptide; 4) cleaving the additional protein 
therefrom; S) effecting folding and linkage of the 
A and B chains; and 6) excising the bridging chain. 

14. A.method according to claim 13 wherein the cloning 
vehicle is plasmid pH17. 

ISA method according to claim 13 wherein the cloning 
vehicle is plasmid pBCAS. 
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